Unified model for network dynamics exhibiting nonextensive statistics 
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We introduce a dynamical network model which unifies a number of network families which 
are individually known to exhibit g-exponential degree distributions. Fhe present model dynam- 
ics incorporates static (non-growing) self-organizing networks, preferentially growing networks, and 
(preferentially) rewiring networks. Further, it exhibits a natural random graph limit. The pro- 
posed model generalizes network dynamics to rewiring and growth modes which depend on internal 
topology as well as on a metric imposed by the space they are embedded in. In all of the networks 
emerging from the presented model we find g-exponential degree distributions over a large parame- 
ter space. We comment on the parameter dependence of the corresponding entropic index q for the 
degree distributions, and on the behavior of the clustering coefficients and neighboring connectivity 
distributions. 

PACS numbers: 05.70.Ln, 89.75.Hc, 89.75.-k 



I. INTRODUCTION 

Over the past two decades, nonextensive statistical mechanics has successfully addressed a wide spectrum of nonequi- 
librium phenomena in non-ergodic and other complex systems [l], Q • Recently, it has also entered the field of networks 
[3> BL (EM 0) B S 03 • Nonextensive statistical mechanics is a generalization of Boltzmann-Gibbs (BG) statistical 
mechanics. It is based on the entropy 

S q = 1 -Sdx\P^)] q (s x = Sbg ^-J dxp(x) Inp(x)) . (1) 

The extremization of the entropy S q under appropriate constraints [TTI | yields the stationary-state distribution. This 
is of the ^-exponential form, where the g-exponential function is defined as 

e* = [l + {l-q)x} 1 '^ , (2) 

for 1 + (1 — q)x > 0, and zero otherwise (with e\ = e x ). The tail exponent 7 = l/(q — 1) characterizes the asymptotic 
power-law distribution. 

Since the very beginning of the tremendous recent modeling efforts of complex networks it has been noticed that 
degree distributions asymptotically follow power-laws [llj], or even exactly ^-exponentials (l3| . The model in (l2| 
describes growing networks with a so-called preferential attachment rule, meaning that any new node i being added 
to the system links itself to an already existing node j in the network with a probability that is proportional to the 
degree kj of node j. In [131 ] this model was extended to also allow for preferential rewiring. The analytical solution 
to the model has a g-exponential as a result, with the nonextensivity parameter q being fixed uniquely by the model 
parameters. Recently in Q preferential attachment networks have been embedded in Euclidean space, where the 
attachment probability for a newly added node is not only proportional to the degrees of existing nodes, but also 
depends on the Euclidean distance between nodes. The model is realized by setting the linking probability of a 
new node to an existing node i to be pn n k oc ki/rf {a > 0), where rj is the distance between the new node and 
node i; a = corresponds to the model in [l2j which has no metrics. The analysis of the degree distributions of 
the resulting networks has exhibited Q ^-exponentials with a clear a-dependence of the nonextensivity parameter q. 
In the large a limit, q approaches unity, i.e., random networks are recovered in the Boltzmann-Gibbs limit. In an 
effort to understand the evolution of socio-economic networks, a model was proposed in Q that builds upon [l3[ but 
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introduces a rewiring scheme which depends on the internal network distance between two nodes, i.e., the number 
of steps needed to connect the two nodes. The emerging degree distributions have been subjected to a statistical 
analysis where the (null) hypothesis of q-exponentials could not be rejected. 

It has been found that networks exhibiting degree distributions compatible with g-exponentials are not at all limited 
to growing and preferentially organizing networks. A model for nongrowing networks which was recently put forward 
in [5J also unambiguously exhibits (/-exponential degree distributions. This model was motivated by interpreting 
networks as a certain type of 'gas' where upon an (inelastic) collision of two nodes, links get transfered in analogy to 
the energy-momentum transfer in real gases. In this model a fixed number of nodes in an (undirected) network can 
'merge', i.e., two nodes fuse into one single node, which keeps the union of links of the two original nodes; the link 
connecting the two nodes before the merger is removed. At the same time a new node is introduced to the system 



and is linked randomly to any of the existing nodes in the network [14J . Due to the nature of this model the number 
of links is not strictly conserved - which can be thought of as jumps between discrete states in some 'phase space'. 
The model has been further generalized to exhibit a distance dependence as in Q, however not being Euclidean 
but internal distance. Again, the resultin g d egree distributions have ^-exponential form. 

A quite different approach was taken in [151 ] where an ensemble interpretation of random networks has been adopted, 
motivated by superstatistics [16j |. Here it was assumed that the average connectivity k in random networks is fluc- 
tuating according to a distribution II(fc), which is sometimes associated with a 'hidden- variable' distribution. In this 
sense a network with any degree distribution can be seen as a 'superposition' of random networks with the degree 

distribution given by p(k) — J °° dX II(fc)^-|j — . It was shown in [TH ], as an exact example, that an asymptotically 
power-law functional form of II(fc) cx fc -7 leads to degree distributions of Zipf- Mandelbrot form, p(k) oc 



(fc0 + fc)T ' 

which is equivalent to a q-exponential e q k ^ K with k = (1 — q)ko and (7 = 1 + 1/7. Very recently a possible connection 
between small-world networks and the maximum 5 q -entropy principle, as well as to the hidden variable method [15| . 
has been noticed in [9(. 

In yet another view, networks have recently been treated as statistical systems on a Hamiltonian basis (l7l. [l8l. ITjjl |20| . 
It has been shown that these systems show a phase transition like behavior [l8l |. along which networks structure 
changes. In the low temperature phase one finds networks of 'star' type, meaning that a few nodes are extremely 
well connected resulting even in a discontinuous p(k); in the high temperature phase one finds random networks. 
Surprisingly, for a special type of Hamiltonians networks with q-exponential degree distributions emerge right in the 
vicinity of the transition point (20| . 

Given the above characteristics of networks and the fact that a vast number of real- world and model networks show 
asymptotic power-law degree distributions, it seems almost obvious to look for a deeper connection between networks 
and nonextensive statistical physics. The purpose of this work is to show that various model types can be unified into 
a single dynamic network-formation model, characterized by a reasonably small number of parameters. Within this 
parameter space, all networks seem to be compatible with (/-exponential degree distributions. 



II. MODEL 

The following model is a unification and generalization of the models presented in 0, Q • The model in Q captures 
preferential growing aspects of networks embedded into a metric space, while Q introduces a static, selforganizing 
model with a sensitivity to an internal metric (chemical distance, Diekstra distance). The rewiring scheme there can 
be thought of having preferential attachment aspects in one of its limits [l4[ (see below), but has none in the other 
limit. 



A. Network model 



The network evolves in time as described in [J]: At t = 1, the first node (i = 1) is placed at some arbitrary position 
in a metric space. The next node is placed isotropically on a sphere (in that space) of radius r, which is drawn from 
a distribution Pair) oc l/r aa (ag > 0, G stands for growth. To avoid problems with the singularity, we impose a 
cutoff at r m j n = 1. The second node is linked to the first. The third node is placed again isotropically on a sphere 
with random radius r G Pg, however the center of the sphere is now the barycenter of all the pre-existing nodes. 
From the third added node on, there is an ambiguity where the newly positioned node should link to. We choose a 
generalized preferential attachment process, meaning that the probability that the newly created node i attaches to 
a previously existing node j is proportional to the degree kj of the existing node j, and on the metric (Euclidean) 
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FIG. 1: Time evolution of the degree of the best connected node (a) and of a randomly chosen node (b) for the parameters, 
N = 10000, a A = 0, a M = 0. 



distance between i and j, denoted by r»j. In particular the linking probability is 

where N(t) is the number of nodes at time t. It is not necessary that at each time step only one node is entering the 
system, so we immediately generalize that a number of n nodes are produced and linked to the existing network with 
I links per time step. Note that n and I can also be random numbers from an arbitrary distribution. For simplicity 
and clarity we fix n = 1 and 1 = 1. 

After every A timesteps, a different action takes place on the network. At this timestep the network does not grow 
but a pair of nodes, say i and j, merge to form one single node (l4| . This node keeps the name of one of the original 
nodes, say for example i. This node now gains all the links of the other node j, resulting in a change of degree for 
node i according to 

ki — > hi + kj — N common , if are not first neighbors 

k% > ki + kj ^Vcommon 2 , if (z, j) are first neighbors (4) 

where N common is the number of nodes, which shared links to both of i and j before the merger. In the case that i and 
j were first neighbors before the merger, i.e., they had been previously linked, the removal of this link will be taken 
care of by the term —2 in Eq. The probability that two nodes i and j merge can be made distance dependent, as 
before. In particular to stay close to the model in Q, we chose the following procedure. We randomly choose node i 
with probability oc 1/N(t) and then choose the merging partner j with probability 

<r aM 

P% = ^=^ («m>0) , (5) 

where is the shortest distance (path) on the network connecting nodes i and j; Obviously, tuning aM from 
toward large values, switches the model from the case where j is picked fully at random (oc 1/N(t)), to a case where 
only nearest neighbors of i will have a nonnegligible chance to get chosen for the merger. Note that the number 
of nodes is reduced by one at that point. To keep the number of nodes constant at this timestep, a new node is 
introduced and linked with I of the existing nodes with probability given in Eq. ([3]) . 

This concludes the model. Summing up, the relevant model parameters, we have the merging exponent aM, the 
attachment exponent a a, controlling the sensitivity of 'distance' in the network, and the relative rate of merging and 
growing, A. The parameters, ag, n, I, and r m ; n have been found to play no major role in the model. 

We simulate this model and record the degrees fcj, the clustering coefficients Cj (defined below), and the nearest 
neighbor-connectivity fc™ n , for all individual nodes i. From these values we derive distribution functions (as a function 
of k). In Fig. H] typical degree distributions are shown for three typical values of A. Obviously, the distribution is 
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FIG. 2: Degree distribution P(k) (un-normalized) for N = 10000, ola = 0, olm = and various values of A. 



dominated by a power-law decay (see details of the functional form below) ending in an exponential finite size cut-off 
for large k. 

The clustering coefficient of node i, Ci is defined by 

2e« (6) 
kiyhi 1) 

with a being the number of triangles node i is part of. c(k) is obtained by averagin g oy er all Cj with a fixed k. 
It has been noted that c(k) contains information about hierarchies present in networks [2l|. For Erdos-Rcnyi (ER) 
networks [22j , as well as for pure preferential attachment algorithms without the possibility of rewiring, the clustering 
coefficient c(k) vs. degree is flat. The global clustering coefficient is the average over all nodes C — (cj)j. A large 
global clustering coefficient is often used for identification of small- world structure [23| . The average nearest-neighbor 
connectivity (of the neighbors) of node i is 

j neighbor of i 

When plotted as a function of k, k nn (k) is a measure to assess the assortativity of networks. A rising function means 
assortativity, which is the tendency for well connected nodes to link to other well connected ones, while a declining 
function signals disassortative structure. 



B. Particular instances of the model 



Depending on the variables of the model, known networks result as natural limits. 



1. Soares et al. limit 



For the lim A — > oo we have no merging, and «m is an irrelevant parameter. The model corresponding to this limit 
has been proposed and studied in 0]. 
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2. Albert- Barabasi limit 

The limA — » oo and lima^ — > 0, gets rid of the metric in the Soares et al. model and recovers the original 
Albert-Barabasi preferential attachment model. 

3. Kim et al. limits 



The limit lim A — > allows no preferential growing of the network. If at each timestep after every merger a new 
node is linked randomly with I links to the network, the model reported i n [a] is recovered. The lim A — > model with 
limofM - * (limaM —> oo) recovers the random case (neighbor case) in [141] . 



III. NONEXTENSIVE CHARACTERIZATION OF COMPLEX NETWORKS 



There has been a convincing body of evidence that, for a large class of networks, (normalized) degree distributions 
can be fit by g-exponentials, 



P(k) = e 



= p -{k-l)/ K 



(k = 1,2,3,4,.. 



(8) 



where the (/-exponential function is defined in Eq. ([2]), with q > 1, and k > some characteristic number of links. 
A convenient procedure to perform a two-parameter fit of this kind is to take the q-logarithm of the distribution P, 



defined by Z q (k) = ln q P(k) 



[Pik)] 1 -"-! 
1-q 



This is done for a series of different values of q. The function Z q (k) which 



can be best fit with a straight line determines the value of q, the slope being —k. 

In Fig. [3] we show the degree distribution for several system sizes together with the q- logarithm Z q (k), from which 
an optimum q and k can be obtained. We conclude that, with good precision, the Ansatz in Eq. ([8]) for the degree 
distribution, when seen as a null hypothesis, can not be rejected on the basis of a \ 2 statistics for any reasonable 
significance level, for the system sizes studied. 

For actual curve fitting, it is often more convenient to use the cumulative distributions, which can be parametrized 
by 



P(> k) = e-C*- 1 )/"' 



(k = 1,2,3,4,...) 



(9) 



On the other hand the corresponding cumulative distribution P(> k) is given by (we switch to integral notation for 
simplicity for a moment) 



P(>k) = l-J dk'P{k') = 



1 - 



-{k-1) 



2-q 
1-q 



By comparison of coefficients the cumulative parameters are given by 

1 . K 



2-q 



and 



<1 



(10) 



(11) 



Whenever we talk about q- values corresponding to a cumulative distribution, we use the notation q c and k c , where c 
indicates cumulative. 

The remarkable quality of g-exponential fits to the degree distributions from the model, reveals a connection Q of 
scale- free network dynamics to nonextensive statistical mechanics To make the point more clear, consider the 

entropy 



S q = 



_1-J 1 00 dk[p(k)}" 



9-1 



Si = Sbg — — 



dkp(k) lnp(fc) 



where we assume A; as a continuous variable for simplicity. If we extremize S q with the constraints [111 ] 

f™dkk\p{k)]« 



/ dkp(k) = 1 and ^ 



dk [p{k)]i 



= K. 



(12) 



(13) 
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FIG. 3: (a) P(k) for A = 2, a. a = 1, olm = 1, and various system sizes (symbols). The line is the g-exponential fit for N = 2000. 
(b) g-logarithm of the (normalized) P(k) from (a). The line associated with q — 1.375 corresponds to an optimal linear fit, i.e. 
a maximum of the correlation coefficient (inset) of a straight line with Z q . The quality of the fit in (a) is given by a standard 
X 2 statistics. 

we obtain 

-/3(fc-i) 

*>(*)= 7ST± -^-D ^P-gK^" ( 14 ) 

where /3 is determined through Eq. (|13| . Both positivity of p(k) and the normalization constraint (|13p impose q < 2. 

Let us mention that models do exist that can be handled analytically, and which exhibit precisely g-exponential 
degree distributions. Such is the case of [T^. The degree distribution is there presented in the form p(k) cx l/(fc + fc ) 7 . 
This form can be re- written as a ^-exponential with q = ^il = ^(^"^rj+i-p-r ' w ^ eie ( m jPi r ) are parameters of the 
particular model in [l3j . 



IV. RESULTS 



Realizing the above network model in numerical simulations we compute degree distributions, clustering coefficients, 
and neighbor connectivity, for a scan over the relevant parameter space, spanned by A, a a and cxm- All following 
data were obtained from averages over 100 identical network realizations with a final iV(i max ) = 1000; for finite size 
checks we have included runs with N(t m&K ) = 500 and 2000. The reason for these relatively modest network sizes is 
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that, at every timestep, all network distances have to evaluated. The remaining parameters have been checked to be 
of marginal importance and have been fixed to ctQ = 1, n = 1 and 1 = 1. 

Qc Kc X 2 

x io -3 K, 




FIG. 4: q c and « c values from q-exponential fits to the cumulative degree distributions P(> k) for etc = 1, N = fOOO, and 
A = 0.5 (top), A = f (middle), A = 2 (bottom). The fit-quality is given by the x 2 value per degree of freedom. 

The fitted values for the nonextensivity index q c and the characteristic degree k c are shown in Fig. [4] over the 
parameter space. From top to bottom three values of A are shown. The q c index is declining in all three parameters, 
otA-, OLM-, and A. It eventually converges to a plateau in the a. a — QfM-plane. The height of the plateau slowly decreases 
with higher A, but remains above 1; q c = 1 corresponds to the exponential (ER) case. For low aju there is a maximum 
of k c at about a a ~ 3; For larger au a plateau is forming for all a a- This plateau remains constant as a function of 
A. The quality of the g-exponential fit is demonstrated by the x 2 test statistics per degree of freedom. 

As in [5| we observe a finite size effect in the data. In Fig. [5] (a) we show the dependence of the degree distribution 
parameters as a function of au for different system sizes for a fixed a a = 5, and A = 2. The fits for k c are shown in 

Fig.[H(b). 

We now turn to the clustering and neighbor connectivity of the emerging networks. In Fig. [S]we show the clustering 
coefficient c and the average neighbor connectivity k nn as a function of k. For both quantities, the functional form of 
the decline with k is well fit with a 2-parameter exponential fit, exp(— e\ k + £2). 

In Fig. [7] we show the fit parameters t\ for c(fc), (a), and k nn (k), (b), for A = 0.5. For larger A the clustering 
coefficients become drastically smaller, as expected for the A — > 00 and cxa —> limit. Fits for cva > 5 and ajvf > 5 
become increasingly noisy and are omitted from the figure. 

In Fig. [8] we compare the global clustering coefficients from our model, with those obtained from a random graph 
with the same dimensions (same number of nodes and links). For the Erdos-Renyi random graph the clustering 
coefficient is C ran d = (k)/N — 1. The comparison makes clear that there is almost no attachment effects for a a > 3 
(i.e., negligible dependence from a a), and a strong dependence on olm and A, as expected. 



8 



3 




FIG. 5: (a) q c values of 3 system sizes for A = 2, oa = 5, and olm ranging from to 5. (b) same for k c . (c) and (d) show 
the same parameters as a function of network size N, for A = 1, a a = au = 0. For these parameters networks up to a size of 
N = 20000 were possible. 

V. DISCUSSION 

We have introduced a general network formation model which is able to recover, as particular instances, a large class 
of known network types. We checked that, to a very good approximation, the resulting degree distributions exhibit 
g-exponential forms, with q > 1. While a full theory of how complex networks are connected to q ^ 1 statistical 
mechanics is still missing, we provide further evidence that such a relation does indeed exist. For example, if we 
associate a finite fixed energy or "cost" to every bond, and associate with each node half of the energy corresponding 
to its bonds (the other half corresponding to the other nodes linked by those same bonds) , then the degree distribution 
can be seen as an energy distribution of the type emerging within nonextensive statistical mechanics. It might well be 
that the full understanding of this relation arises from the discrete nature of networks. The importance of appropriate 
values of q ^ 1 for systems 'living' in topologies with a vanishing Lebesgue measure has been pointed out before (2j. 
This possibly makes phase space for certain nonextensive systems look like a network itself. In this view the basis of 
nonextensive systems could be related to a network-like structure of their 'phase space', explaining the ubiquity of 
g-exponential distribution functions in the world of networks. 

Let us end by pointing out that, in variance with frequent such statements in the literature, the present model 
neatly illustrates that never ending growth is not necessary for having networks that are (asymptotically) scale-free. 
Indeed, g-exponential degree distributions do emerge for large enough networks which do not necessarily keep growing. 
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FIG. 7: Exponential decay constants ei for c(k) (a) and k nn (k) (b) over a a and cum for A = 0.5. The fit range was k € [1, 100] and 
averages over 100 independent configurations have been taken. Fits for a a > 5 and clm > 5 become statistically insignificant. 
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